Identification and analysis of ancestral hominoid transcriptome inferred from cross-species transcript and processed pseudogene comparisons.

نویسندگان

  • Yao-Ting Huang
  • Feng-Chi Chen
  • Chiuan-Jung Chen
  • Hsin-Liang Chen
  • Trees-Juen Chuang
چکیده

Comparative transcriptomics studies in hominoids are difficult because of lack of EST information in the great apes. Nevertheless, processed pseudogenes (PPGs), which are reverse-transcribed ancient transcripts present in the current genome, can be regarded as a virtual transcript resource that may compensate for the paucity of ESTs in non-human hominoids. Here we show that chimpanzee PPGs can be applied to identification of novel human exons/alternatively spliced variants (ASVs) and inference of the ancestral hominoid transcriptome and chimpanzee exon loss events. We develop a method for comparatively extracting novel transcripts from PPGs (designated "CENTP") and identify 643 novel human exons/ASVs. RT-PCR-sequencing experiments confirmed >50% of the tested exons/ASVs, supporting the effectiveness of the CENTP pipeline. With reference to the ancestral transcriptome inferred by CENTP, 47 chimpanzee exon loss events are identified. Furthermore, by combining out-group and PPG information, we identify 20 chimpanzee-specific exon loss and 10 human-specific exon gain events. We also demonstrate that the ancestral transcriptome and exon loss/gain events inferred based on comparisons of current transcripts may be incomplete (or occasionally inappropriate) because ancestral transcripts may not be represented in the ESTs of existing species. Finally, functional analysis reveals that the novel exons identified based on chimpanzee transcripts are significantly enriched in genes related to translation regulatory activity and viral life cycle, suggesting different expression levels of the associated transcripts, and thus divergent splicing isoform composition between human and chimpanzee in these functional categories.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hominoid-Specific De Novo Protein-Coding Genes Originating from Long Non-Coding RNAs

Tinkering with pre-existing genes has long been known as a major way to create new genes. Recently, however, motherless protein-coding genes have been found to have emerged de novo from ancestral non-coding DNAs. How these genes originated is not well addressed to date. Here we identified 24 hominoid-specific de novo protein-coding genes with precise origination timing in vertebrate phylogeny. ...

متن کامل

Computational Identification of Micro RNAs and Their Transcript Target(s) in Field Mustard (Brassica rapa L.)

Background: Micro RNAs (miRNAs) are a pivotal part of non-protein-coding endogenous small RNA molecules that regulate the genes involved in plant growth and development, and respond to biotic and abiotic environmental stresses posttranscriptionally.Objective: In the present study, we report the results of a systemic search for identifi cation of new miRNAs in B. rapa using homology-based ...

متن کامل

I-13: Transcriptome Dynamics of Human and Mouse Preimplantation Embryos Revealed by Single Cell RNA-Sequencing

Background: Mammalian preimplantation development is a complex process involving dramatic changes in the transcriptional architecture. However, it is still unclear about the crucial transcriptional network and key hub genes that regulate the proceeding of preimplantation embryos. Materials and Methods: Through single-cell RNAsequencing (RNA-seq) of both human and mouse preimplantation embryos, ...

متن کامل

Genomic fossils as a snapshot of the human transcriptome.

Processed pseudogenes (PPGs) are cDNA sequences that were generated through reverse transcription of mature, spliced mRNAs and have subsequently been reinserted at a new genomic location. These cDNA sequences are usually no longer transcribed and are considered "dead on arrival." Here we show that PPGs can be used to generate a map of the transcriptome. By analyzing thousands of human PPGs, we ...

متن کامل

Microsatellite Analysis for Differentiation and Identification of the Source Tree of Fagus orientalis Lipsky

The present study describes approaches for the identification of individual beech trees using maternal tissues from their seeds or fruits. Four microsatellite markers were used for genetic analysis of seedlots from Fagus orientalis Lipsky, a highly out-crossing tree species. Seeds from 11 single-tree harvests belonging to one population, (7 seeds from each), as well as non-paranchymatic materna...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Genome research

دوره 18 7  شماره 

صفحات  -

تاریخ انتشار 2008